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Abstract 

After the recent discovery of a 125 GeV Higgs-like particle at the Large Hadron Cohider (LHC), 
it is crucial to examine its role in unitarizing high energy WlWl scattering, which may reveal its 



Oh 



K*" possible deviation from a Standard Model Higgs. We perform an updated study on WW scattering 

in 

^^ 20% larger than that based on the conventionally "gold-plated" dileptonic channel, while 200% 
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in the semileptonic channel at the LHC, improved by the recently developed W jet tagging method. 
The resultant statistical significance of a Strongly-Interacting Light Higgs (SILH) model is about 



more signal events are retained. This allows the discovery of an SILH model if the signal strength 
is 100% (10%) of a pure Higgsless model, using about 40 (3000) fb"^ data at the 14 TeV LHC. 
Meanwhile, the excellent sensitivity to the anomalous Higgs- VF boson coupling makes semileptonic 
WW scattering an important complement to precision measurements at the Higgs resonance. 



1. INTRODUCTION 

Longitudinal WW scattering at high energies directly probes the mechanism of elec- 
troweak symmetry breaking (EWSB). In particular, with unitarity as a guideline this pro- 
cess has been seen as a promising place to search for new physics if EWSB involves strong 
dynamics such as in the Higgsless models. The recent exciting discovery of a Higgs-like 
particle at the LHC [Ij does not eliminate the importance of WlWl scattering. This is in 
part because the measurements [2] of the cross sections of the Higgs-like particle still bear 
significant uncertainties as for now, which allows the possibility that the observed resonance 
is a Higgs imposter or a dilaton/radion [3] , while EWSB follows the pattern of a Higgsless 
model. Of course a more naturally favored scenario in light of the recent data is that the 
observed new particle is indeed mostly a Standard Model (SM) Higgs boson [4J. In this case 
it is important and intriguing to explore possible deviations in Higgs properties from the 
SM predictions, which can originate from an extended Higgs sector or other new physics 
connected to the Higgs. WlWl scattering is an important place to look for such deviations, 
since a key role of the SM Higgs is to unitarize this process at high energies. 

Most of the efforts exploring beyond-the-SM properties of the Higgs-like new particle have 
been focusing on extracting on-shell couplings from measurements at the Higgs resonance. 
Another prospect not as well stressed is that: processes involving an intermediate off-shell 
Higgs may provide complementary information to what we can learn from the on-shell 
measurements. WlWl scattering as the focus of this paper, can be seen as such an example. 
First, in regard to measuring the Higgs- VT boson coupling, unlike precision measurements at 
the Higgs pole, WlWl scattering is insensitive to uncertainties from invisible or exotic Higgs 
decays. Furthermore, with enough statistics WlWl scattering may resolve different models 
that can be nearly degenerate at the Higgs resonance. For instance, a two-Higgs-doublet 
model and a composite Higgs model [5J both may include a 125 GeV scalar with a coupling 
to the W boson deviated from that of the SM Higgs by a similar amount. On the other 
hand, the two models may have very different behaviors in high energy WlWl scattering 
which captures the sum-over effect from all intermediate states responsible for EWSB. 

As a case study, we explore a generic class of composite Higgs models, the Strongly 
Interacting Light Higgs (SILH) models |5j. The signal of such models is defined as the event 
excess in WlWl scattering over the SM prediction with a light Higgs. We investigate the 
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LHC sensitivity to the excess in the semileptonic decay channel, using the recently developed 
IV-jet tagging technique [H]. The signal of the SILH model can be well approximated by that 
of the Higgsless model, scaled down by a factor ~ {cuiY, which quantifies the deviations of 
the Higgs-weak boson couplings from the SM |5l [7] . This allows us to focus on the limiting 
case of the Higgsless model and then infer the sensitivities for the SILH models using the 
scaling relation. The Higgsless model signal is obtained by turning off diagrams involving 
the Higgs boson in the SM, without introducing heavy WW resonances. The existence 
of such resonances is implied by the breakdown of perturbative unitarity, which typically 
enhances the signal strength. Therefore, the sensitivities derived in our study can be seen as 
a conservative estimate. Meanwhile, we emphasize that if these resonances weakly couples 
to the fermions, it is essential to search for them in the WW scattering signal. 

Compared to the dileptonic channel, the well-known challenge for the semileptonic WlWl 
scattering is the contamination from QCD backgrounds. However, the semileptonic channel 
is still appealing because it yields much more signal events and enables reconstruction of the 
W momenta and thus important kinematics such as the WW invariant mass. Central jet veto 
and tagging jets requirements have been proposed to reject major QCD backgrounds such 
as from tt+jets, and boost the longitudinal fraction of the W^s |8J. A persistent background 
is VT+jets where the W decays leptonically. In a signal event, the hadronically decaying W 
is highly boosted at high energies, thus behaves as a single jet in a collider detector, which 
we call a W jet. In order to reject the VT+jets background, it is essential to distinguish a W 
jet from a QCD jet initiated from a quark or a gluon, which has been a focus of the recent 
jet substructure studies (see [9] for a review). Nevertheless, the latest work before ours on 
WW scattering using jet substructure dates back more than a decade ago [10], where a 
prototype of the filtering/mass drop method fTT] was applied to reject QCD backgrounds. 
In this work we apply the more advanced multivariate W jet tagging method proposed in 
an earlier paper [6j. We find the resultant sensitivity to the beyond-the-SM signal in W^Wl 
scattering is better than those in the literature, including those based on the dileptonic 
channel [121 US]- Our results also suggest that, at the high luminosity 14 TeV run of the 
LHC, WlWl scattering has a good sensitivity to the anomalous hWW coupling. 

The rest of the paper is organized as follows. In Section [2] we give definitions of the signal 
and the backgrounds, and demonstrate at parton level how to suppress the electroweak 
backgrounds which includes the WlWt and WtWt components of WW scattering. In 



Section [3] we present results from a jet level analysis using the W jet tagging method, 
taking into account all major backgrounds to the semileptonic WlWl scattering. Finally 
we conclude in Section HI 

2. WW SCATTERING AT PARTON LEVEL 

A. Signal and Backgrounds Definitions 

At a hadron collider such as the LHC, a WW scattering event is characterized by a PF 
pair produced with a pair of forward and backward tagging jets. Following the "subtraction" 
approach proposed in Refs. [8|, we define the new physics signal as the event excess in WW 
scattering over the SM prediction with a light Higgs {rrih ~ 125 GeV): 

S = a{pp -^ JjWl^)ncw physics " C^{PP ^ jjWW)sM- (l) 

As mentioned in the introduction, although a 125 GeV Higgs-like particle has been dis- 
covered at the LHC, a sizable excess in WW scattering can still emerge if the new particle 
has non-SM couplings to W and Z, and cannot fully unitarize WlWl scattering. This hap- 
pens in a generic class of composite Higgs models, namely, the Strongly Interacting Light 
Higgs (SILH) [5j models. In particular, the Lagrangian in these models includes the follow- 
ing dimension-6 operator, which affects WlWl scattering at ^/sww ^ f^hi where ^/sww is 
the invariant mass of the W pair: 

C D ^^d^{H^H)d,{H^H), (2) 

where / is the characteristic scale of the new physics. This operator results in a modified 



Higgs-gauge coupling: g^s = gsAi/V^ + ch^ ^ (1 - CH^/2)gsM, where the approximation 
holds for small chC,, C, = v^ / p {v = 246 GeV is the Higgs vev) and ch ~ 0(1— 47r) depending 
on the underlying theory. Notice that the cross section of WlWl — )■ WlWl at ^/sww ^ ^/i 
is related to the Higgsless model {gcs = 0) by [5]: 

(TsiMWlWl ^ WlWl) ~ (c^O'^Higgsiess(W^LW^L ^ WlWl), (3) 

where the Higgsless model cross section can be obtained by fixing v = 246 GeV and taking 
the Higgs mass to infinity. Given the observation that the newly discovered particle has 



Higgs-like couplings to the gauge bosons, a pure Higgsless assumption may not be realistic. 
However, it is convenient to take the Higgsless limit as one of our bench mark points to 
evaluate the performance of our method. 

As demonstrated in Ref. [13], unlike WlWl scattering, the cross sections of WtWt and 
WlWt productions are largely independent of the Higgs mass, and almost the same in the 
Higgsless model and the SM. Moreover, the cross sections for polarization flipping processes 
are negligibly small (about 3 orders of magnitude smaller than polarization conserving ones 
[13]). Therefore, to a good approximation, except for W^Wl — ?• WlWl, other contributions 
to the signal as defined in Eq. ([I]), are cancelled out. Our signal definition for the SILH 
models can then be rewritten as 

>S'siLH = Cr{pp^ jjWLWL)siLR-Cr{pp -^ JJWlWl)sM 

^ a{pp ^ jjWlWl)silr- (4) 

In the second line of Eq. Q, we have ignored the SM contribution to WlWl production, 
which is valid only when we require forward and backward tagging jets to select events 
produced from WW scattering, and large WW invariant mass to ensure crsiLuiWLWL — )■ 
WlWl) > (Tsm{WlWl -^ WlWl)- In this case, the SILH signal is related to the Higgsless 
signal by a simple rescaling (using Eq. (|3|)): 



SsILh{chO ~ {chO"^ Smggsless- (5) 

The signal definition and the scaling in ch^ are illustrated in Fig. [TJ 

In light of this simple scaling relation we will focus on analyzing the LHC sensitivity for 
the Higgsless assumption in the following discussions. Then we use Eq. rtsl) to estimate the 
sensitivity of the SILH case parametrized by chC Strictly speaking, in the SILH models 
Eq. ^ only holds when ch^ < 1 (but not too small to make the SM WlWl -^ WlWl 
non-negligible). Here, we simply take {chC]"^ as a measure of the signal strength and use the 
scaling relation in Eq. d5| even for larger {chO"^ values. Finally, because of the quadratic 
growth of the scattering amplitude in ^/swwJ perturbative unitarity will break down at 
sww ~ IGtiv^/ch^ [7j for the SILH models and a new resonance(s) is needed to restore 
unitarity. Therefore, a pure Higgsless/SILH assumption, as we will adopt, is unphysical 
beyond the break-down scale. However, since the properties of such a resonance(s) are 
model dependent and our focus is on the impact of the search techniques, we stick to the 
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FIG. 1: VFWjj signals as a function oiniy/m: before (left) and after (right) ap^ > 350 GeV cut on 
both VF's. Only electroweak processes are included in the plots. The blue solid curve denotes the 
PFWjj cross section in the SM with a light Higgs {Mh = 125 GeV). The red solid curve is for the 
Higgsless model, while the dotted and dashed lines are for SILH with (chS,)'^ = 0.5, {chO'^ = 0.2, 
respectively. The signal (light red area) is defined as the difference between the Higgsless/SILH 
and the SM. Tagging jets are required (Eq. ([6])). 



non-resonance assumption. In general this results in a conservative estimate for the signal 
significance, compared with the case when a resonance(s) is included. It also allows us to 
compare with previous studies in the literature under the same assumption, such as Ref. [T^ 

Now we move on to consider relevant backgrounds. Since we are interested in the excess 
over the SM, the SM WW scattering events become an irreducible background. These 
events come from the diagrams involving only electroweak couplings^, which are present 
regardless whether the W^s decay hadronically or leptonically. We will call these backgrounds 
EW backgrounds. Moreover, as mentioned, in this article we focus on exploring the WW 
scattering signal in the semileptonic decay channel. Therefore inevitably, we also need to 
consider processes involving QCD couplings, dominated by tt+jets and ly+jets. As we will 
show in the next section, by using our multivariate W jet tagging method, we are able to 
reduce the QCD backgrounds to a similar level as the EW backgrounds and the final signal 



^ Strictly speaking, these diagrams cannot be completely separated from those involving QCD couplings 
with the same final state particles due to interferences. However, the interferences are tiny in the phase 
space we are interested in |13) . 



sensitivity becomes better than the dileptonic channel. In the next subsection, we will first 
consider how to reduce the EW backgrounds by examining their differences from the signal 
at the parton level. In Section [3} we will include all backgrounds at the jet level and obtain 
our final results. 

B. Suppressing the electroweak backgrounds — parton level 

We generate pp — )■ WWjj — )■ Ivjjjj events with MadGraph 5 [15] at the parton level, for 
the SM with a 125 GeV Higgs, and for the Higgsless model by turning off the SM diagrams 
containing the Higgs boson. All charge combinations for the W pair are included. In this 
section, we only consider backgrounds with pure electroweak couplings, which are present 
regardless how W^s decay, while postponing the discussions on QCD backgrounds to the 
next section. For illustration, we discuss the Monte Carlo truth at the parton level, which 
can be straightforwardly applied to the jet level analysis. 

At the generation level, the following cuts are applied on the two tagging jets to improve 
the efficiency: 

|r/j|<5, y^ > 10 GeV, E^ > 150 GeV, M^j > 300 GeV. (6) 

At the analysis stage we further employ the following selection criteria to enhance the 
signal-background ratio. 

• Two tagging jets ji, J2 with 

2 < |r/| < 5, pt > 25 GeV, E > 340 GeV and r/j, ■ r/j^ < 0. (7) 

• p^ > 350 GeV for both W's. 

• The two partons from the W decay have a pt ratio > 0.1 (lower /higher), for both 

W's. 

• mww > 850 GeV. 

We summarize the signal and background cross sections after each cut in Table [l] Note that 
we have chosen the cuts to be similar to the jet level cuts we use in Section [3| although 
other values of the cuts may also suffice for the illustration here. 





Initial a (fb) 


tagging jets 


p^ > 350 GeV 


Pt ratio > 0.1 


mww > 850 GeV 


Higgsless 


252 


54.9 


2.15 


1.84 


1.74 


SMEW 


236 


48.8 


0.54 


0.37 


0.34 


Signal (fiHiggsiess - (^sm) 


16.0 


6.07 


1.61 


1.47 


1.40 


S/B 


0.068 


0.124 


2.97 


3.97 


4.13 



TABLE 1: Parton level cross sections of the signal and the SM electroweak backgrounds (in fb) in 
the semileptonic channel, and the signal-background ratio after each cut. 

The tagging jet cuts are used to select the events from WW scattering. After the tagging 
jet cuts, we expect the signal-background ratio to increase at higher WW invariant mass 



due to the 



'WW 



growth in the signal WlWl — )■ W^Wl cross section. However, from 



Fig. [T] we also see even at 1 TeV, the remaining events are still dominated by the SM 
backgrounds, which are mainly from WtWt and WtWl productions. For example, if we 
apply an Mww > 850 GeV cut directly after the tagging jet cuts, we get a signal-background 
ratio of 0.49. 

To further reduce the background, we utilize the observation [13] that the angular dis- 
tribution for the WlWl — )■ W^Wl scattering in the Higgsless model is different from 
WW — )■ WW (including all polarizations) in the SM, namely, with respect to the incoming 
W direction, the scattered VT's are more central in the former case than in the latter. This 
effect is also more significant at higher ^/sww [13]. Since the incoming W's are radiated 
close to the beam line, this results in a larger p^ for the signal than for the background in 
the lab frame. The p^ distributions after the tagging jet cuts are shown in Fig. pi We then 
choose a p^ > 350 GeV cut, and obtain the right-side plot in Fig. [l| The signal-background 
ratio becomes 3.0. For comparison, the corresponding plot before the p^ cut is shown on 
the left side of Fig. [I] After the pj^ cut, the WlWl component is found to be about 4% of 
the 0.54 fb SM EW backgrounds, which is negligibly small compared with the signal. This 
justifies the second line in Eq. Q. 

Our final control over the EW backgrounds comes from the W polarizations: the signal 
only contains longitudinal PF's, while the background contains both longitudinal and trans- 
verse VT's. This results in different angular distributions for the W decay products. For a 
longitudinal W , the decay products tend to move perpendicularly to the W moving direc- 
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FIG. 2: Left: cross sections for the Higgsless model and for the SM electroweak backgrounds after 
tagging jet cuts, as a function of p^; right: the signal-background ratio, (dciHiggsiess — do"sM)/do"sM) 
as a function of j?^. 
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FIG. 3: The ratio of the lower pT to the higher pT for the two partons from W decay. 



tion, in the W rest frame; for a transverse W, one parton tends to move along the W moving 
direction while the other one against it. Therefore, when the W is boosted, the momenta of 
the two decay products tend to be more balanced for a longitudinal W than for a transverse 
W. We see this effect by examining the pt ratio of the two decay products, as shown in 
Fig. [3j The Higgsless model, dominated by longitudinal W^s, has fewer events in the low 
PT-TdXio region. Therefore, we can put a cut on p-p-ratio to further increase S/B, as shown 
in Table [1} In practice, when a highly boosted W decays hadronically, the two partons yield 
two subjets. We then require a more balanced momentum configuration for the two subjets. 



Coincidentally, this requirement is also essential for reducing QCD jet backgrounds since 
QCD jets typically do not contain balanced subjets. 

3. WW SCATTERING WITH JET SUBSTRUCTURE 

The dileptonic decay channel for a W pair is considered "gold-plated" conventionally, 
since it has smaller backgrounds than the semileptonic channel or the all hadronic channel. 
However, it suffers from a smaller branching ratio and thus worse statistics. In addition, the 
presence of two missing neutrinos makes it hard to reconstruct the kinematics. In this article, 
we will concentrate on the semileptonic channel, and show that it has a better discovery 
potential than the dileptonic channel. 

Due to the fact that we are studying WW scattering at high ^/sww around 1 TeV, 
the hadronic W is highly boosted and thus it is important to distinguish it from a QCD 
jet. We will utilize the state-of-art W tagging method as described in Ref. [6]. Here it 
is helpful to first briefly review the physics behind VT-jet tagging and the major results 
in Ref. [6]. There are two major differences between a PF-jet and a QCD jet. Firstly, a 
W jet contains two hard subjets {i.e., subregions where the jet energy is concentrated), 
originated from the two quarks in the W decay, while a QCD jet usually has only one 
hard subjet. Various jet algorithms have been proposed for identifying the hard subjets, 
including filtering [11], pruning [16] and trimming [T7]. Secondly, the W boson is a color 
singlet particle, consequently all QCD radiation from the decay of a boosted W is confined in 
a small cone around the W momentum direction. On the other hand, a QCD jet is initiated 
from a color triplet or octet, which is color-connected to the beam or the other side of the 
event. Therefore, the radiation of a QCD jet is usually much more diffuse. This difference 
is manifest, for example, by examining how the jet mass (or px) grows when the jet radius 
increases from the jet energy center [6]. This is well captured by the R-coie variables defined 
in Ref. [6] . The most powerful discriminant is built upon variables sensitive to the above two 
major differences using a multivariate analysis (MVA), as shown in Ref. [6], where a factor 
of ~ 5 is achieved in the improvement of statistical significance (S/y/B) for px > 200 GeV. 
We give more information of the multivariate W-tagging method in the Appendix, while 
refer readers to Ref. p] for further details. 



A. Event simulation and selection 

As described in Sec. |2j electroweak processes involving WW + 2 jets are simulated with 
Madgraph 5 at the parton level for the Higgsless model and the SM with a light Higgs. 
Minimal cuts in Eq. d6| are applied when generating the events. We then use Pythia 8 [18] 
to add parton showers and hadronization. Besides the electroweak backgrounds discussed 
in Sec. |2} the process also receives backgrounds from ti + jets and W + jets, in which 
QCD couplings are involved. For simplicity we will slightly abuse the nomenclature and 
call these QCD backgrounds although they also involve electroweak couplings. To estimate 
these backgrounds, we simulate the 2 — )■ 2 processes of W^ + 1 jet and ti with Pythia 8 
and have the extra jets generated from parton showers. In addition, in Sec. [2] we only 
included pure electroweak processes for WW + 2 jets, while omitting processes with QCD 
couplings. As discussed in Ref. [T3|, the latter is a minor background once we require two 
forward/backward tagging jets. For completeness, in this Section we include this latter 
background as part of our QCD backgrounds and treat it similar to ti + jets and W^+jets, 
namely, we generate pp — )■ WW events and add parton showers with Pythia 8. As mentioned 
in Ref. [19], the tagging jets in the QCD backgrounds may not be modeled precisely from 
the parton shower approach. A more accurate background estimate may be obtained using 
other tools such as Alpgen pOj or MC@NLO [21j. Because the main purpose of this paper 
is to show the relative improvement in the study of WW scattering by using W jet tagging, 
we stick to the Pythia 8 simulations. 

At the generation level, a. pr > 150 GeV cut is applied for the W^ + 1 jet and WW 
processes and pr > 100 GeV for ti. We assume a 50% b-jet tagging efficiency and veto 
events that contain one or more b-jets. This cuts off 75% ti events. We use the default 
settings of Pythia 8, in which ISR, FSR and the underlying event are included, while pileup 
is not included^. After hadronization, all visible stable particles are grouped in 0.1 x 0.1 
bins on the (77, 0) plane, corresponding to hadronic calorimeter resolutions. 



^ Pileup will affect the performance of our W jet tagging method. However, it is shown in Ref. ^\ that the 
effect can be largely corrected using a subtraction scheme. As an example, Ref. [22] shows the performance 
of top tagging degrades by about 10% in terms of S/\/~B when the number of pileup vertices is 60. It 
remains to be seen whether this type of subtraction performs similarly well when incorporated in W jet 
tagging. Moreover, we note that some of the pileup effect can be avoided by using variables based on the 
tracking information ^23], and requiring the tracks to originate from the primary vertex. 
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We then reconstruct the signal and background events and impose selection cuts with 
the following procedure. At this stage, we keep some of the critical cuts unfixed and later 
will vary them to obtain the optimal results according to the case being studied. 

1. Lepton and missing momentum from W ^ iu 

Isolated leptons are defined similar to Ref. [21]: an isolated muon or electron has to 
pass the cuts: px > 20 GeV and \ri\ < 2.4. The sum of the transverse energies around 
the lepton in a cone AR = y A^^ + Ar/^ < 0.3 must satisfy '^Et < 0.14p^''. The 
sum of the transverse momenta for all tracks around the lepton in a cone R < 0.3 must 
satisfy Xltracks Pt < 0.13p-^^. The energies of all isolated leptons are removed from the 
corresponding calorimeter cells for jet clustering. The isolated lepton with the highest 
Pt is assumed to be the lepton from the leptonic W decay, which is required to satisfy 
p^ > 40 GeV. Furthermore, we apply a cut on the missing transverse momentum: 
^T > 40 GeV, where ^t is defined as the negative sum of the visible momenta, including 
those of the isolated leptons and those deposited in the calorimeter cells. 

2. Neutrino momentum reconstruction 

Assuming the transverse momentum of the neutrino from W —^ iu is given by the 
missing transverse momentum, we use the W mass shell constraint to solve for the 
neutrino momentum in the beam direction, p'^. This method gives us a quadratic 
equation leading to either or 2 real solutions. We discard events without real solu- 
tions. In the case of two real solutions, we keep the solution that gives the smaller 
\p'^\. The reconstructed leptonic W is required to satisfy 

#-^^ > Ptuv (8) 

3. ly jet tagging 

We use the Cambridge /Aachen algorithm in FastJet [23] and cluster the calorimeter 
cells to fat jets with _R = 1.2 . The leading jet is taken to be the W jet candidate 
which is required to pass the filtering/mass drop procedure described in Appendix lAl 
After filtering, the jet mass is required to fall in the window (60, 100) GeV. We take 
the jet momentum after filtering as the W momentum, which is required to pass 
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To further discriminate W jets from QCD jets, we use the W jet tagging method 
described in Ref. [1]. In this method, Boosted Decision Trees (BDT), a muhivariate 
event classifier, is used to discriminate W jets from QCD jets. The signal/background 
efficiency is variable by varying the BDT cuts. In Ref. [6], we trained jets in different 
Pt bins (bin size = 50 GeV) separately, therefore the BDT cuts for different pt bins 
can be chosen independently. In this work, to simplify our procedure, we choose the 
BDT cuts so that the nominal signal efficiency ^f °^ is the same for all px bins. Here 
B^^^ varies from to 1, where 1 means we keep all the signal events that have passed 
the filtering/mass drop procedure. Note that the nominal signal efficiencies in Ref. |S] 
were obtained from the SM WW pair production, where the W^s are dominantly 
transversely polarized. When the W^s are dominantly longitudinal, as in our signal 
events, the actual efficiency is higher. 

4. Central jet veto 

We remove the calorimeter cells constituting the W jet after filtering. The remaining 
calorimeter cells of the fat jet, as well as all other calorimeter cells are reclustered with 
a smaller jet radius, R = 0.4, using the anti-fci algorithm. We veto events with one or 
more central jets that satisfy \ri\ < 2 and px > Pt cut- 

5. Forward/backward tagging jet cuts 

We look for a forward tagging jet with 5 > r] > 2 and a backward jet with —5 < r] < 
—2. The energies of the two tagging jets are required to pass the following cuts: 

TTiforward ^ rpfb Tpbackward ^ rrifb /-i r\\ 

6. Mww cut 

Finally, we apply a cut on the WW invariant mass, 

M\YW > ^WVK,cut- (11) 

This cut is useful, especially for reducing the QCD backgrounds, which drop faster than the 
signal when M^w increases. 
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B. Results 

In the above procedure, we did not specify the values for the following cuts: the pt cuts 
for the leptonic and hadronic W's, which we take to be equal, Pr^uv ^^^ BDT efficiency, 
ef^'^; the pr cut for the central jet, p^cuti ^^^ energy cuts for the forward and backward 
tagging jets, which we take to be equal, E^^^, and the WW invariant mass cut, Mww,cnt- We 
then vary these cuts to achieve the best statistical significance, S/y/B, where B includes all 
the EW and QCD backgrounds. Note that we may choose different pt cuts for the hadronic 
W and the leptonic W, as well as different cuts for the leading and the next-to-leading 
forward/backward jet. The BDT efficiency for each px bin can also be chosen independently. 
Allowing more cuts to be variable, we may obtain better results than presented in this article. 

For the Higgsless model, the best cuts are found to be: 

p^^^^ = 350 GeV, e^°^ = 0.54, p^_^^, = 70 GeV, E.^^^ = 340 GeV, Mvy^/.cut = 850 GeV. 

(12) 
We see that the optimal central jet veto cut is rather mild: p^^ut ~ '^^ GeV. This cut was 
mostly designed to reduce the ti background because it contains extra jet activities in the 
central region. In our approach, this cut is partially redundant to the W tagging method, 
where a W jet candidate is required to have few extra hadronic activities, as expected for a 
jet originating from a color singlet particle. 

The resultant Mww distributions (before the final Mww cut) for the signal and the 
backgrounds are shown in Fig. |4} After the Mw^^ > 850 GeV cut, the cross section for the 
signal is 0.49 fb and for the total backgrounds 0.24 fb. With 40 fb"^ data at the 14 TeV 
LHC, we can discover a Higgsless model at the 5 a level (Poisson statistics is used for small 
numbers of signal and background events), while the discovery potential for the SILH model 
is scaled down by a factor of {chC,Y- This means that, the LHC at the high luminosity 
(LHC-HL) run with ~ 3000 fb^^ data can lead to the discovery of an SILH model with 
ch^ ~ 0.3, corresponding to a ~ 15% deviation from the SM hWW coupling. Note that in 
the above estimates, we have not included systematic uncertainties, which are important, 
particularly for small S/B ratios. For example, when chC, ~ 0.3, S/B ^ 0.2, therefore the 
systematic uncertainties on the backgrounds have to be reduced to the level of a few percent 
to make a discovery. A dedicated study is needed to understand whether this is achievable. 
Here, we speculate on methods for eliminating some of the systematic uncertainties. The 
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QCD backgrounds have large theoretical uncertainties in the cross sections, however, most 
of them can be eliminated by using data-driven methods when large statistics is available. 
For example, we may use b-rich tt events, in which signal events are rare, as a control sample 
to estimate the ti background in the signal sample. To estimate the size of the W^ + jet 
background, the control sample can be chosen as events with very similar kinematics, but 
with the W candidate jet mass just outside the W mass window. A data driven method 
is perhaps unavailable for the SM EW background, which contains real VBF events. The 
current theoretical calculations for the cross section of a 1 TeV Higgs in the VBF channel have 
< 10% uncertainties [2S], which needs to be refined to make it directly usable. However, one 
may also examine the polarization fractions instead of the total cross sections to reduce the 
systematic uncertainties, following Ref. [^. Moreover, we can also achieve a larger S/B by 
slightly sacrificing the significance. For example, by setting p^^^^^ = 440 GeV, s^^^ = 0.36, 
p^^^^^^ = 90 GeV, E,^t = 400 GeV and Mww,cnt = 1200 GeV, we obtain as = 0.2 fb and 
o"B = 0.043 fb, where S/B is increased by more than 100% and S/yB only decreases by 
4%. 

If the systematic uncertainties are under control, in terms of the sensitivity to the anoma- 
lous hWW coupling, at 1 a the 3000 fb^^ run has a sensitivity to cui ~ 0.15, i.e., a ~ 8% 
deviation from the SM coupling. Taking c^ ~ 1, this implies a sensitivity for the composite 
scale A ~ 47r/ up to ~ 8 TeV. In comparison, fitting based on precision measurements at 
the Higgs resonance offers a very different path for probing a possible anomalous hWW 
coupling. There the 1 a error bar is estimated to be ~ 5% of the SM prediction at the 
LHC-HL [29H3T] . dominated by systematic uncertainties. Though our results are not with- 
out systematic uncertainties, they are from completely different sources. In particular, at 
the Higgs pole, all Higgs production and decay channels are entangled and have to be fit 
together, while WW scattering is a direct probe to the hWW coupling, insensitive to un- 
certainties such as invisible/exotic Higgs decays. Therefore, we see that assisted by the jet 
substructure techniques, WW scattering in the semileptonic channel can be a complement 
to the Higgs pole measurement of the hWW coupling. 
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Mh = 125GeV 
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tt 
PT > lOOGeV 


WW{+QCD jets) 
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ax BR (fb) 
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isolated lepton 
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64.8 
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43.2k 
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leptonic W reconstruction 
p^^^^ > 350 GeV 
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24.4 


^P^->ii,mtcring ^ 35Q Q^y 


3.31 
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23.6 
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8.82 


1^-jet tagging, e^^'^ = 0.54 


2.08 


0.81 


5.38 


13.8 


5.43 


tagging jets found 
E > 340 GeV 


0.73 


0.14 


0.16 


0.14 


0.02 


p^^ < 70 GeV 


0.63 


0.12 


0.06 


0.09 


0.01 


mww > 850 GeV 


0.59 


0.11 


0.04 


0.07 


0.01 




5 = H] 


ggsless - SM = 0.49 fb, B = 0.24 fb 





TABLE 2: Step by step cross sections after each cut. The cross section of tt is multiphed by a 
factor of 0.25 to account for b-jet veto effect. 

C. Comparison to other analyses 



Dileptonic W decay channels, especially from same sign W^s, are usually considered as 
"gold plated" due to their lower backgrounds compared to the semi- leptonic channel. We 
compare our results to the most recent results for the dileptonic channels such as in Ref. |13] . 
The authors of Ref. |^ showed the important difference in the W pt distribution between 
the SM and the Higgsless model, although in practice lepton pt is used as the discriminator 
because the Ws^ momenta are not reconstructable in the dileptonic channel. By utilizing the 
difference in the lepton pt distribution they obtained better results than previous analyses, 
for example, [12]. In particular, the best performance comes from the same sign dileptonic 
channel. With 100 fb^^, they obtain S/B ^ 17/3 from their jjWW simulations, excluding 
tt background. Their estimate for the tt background is about 1 event per 100 fb~^, where 
the same sign lepton comes from a b meson decay. Therefore, their final best result is 
S/B ^ 17/4 for the same sign dileptonic channel. Note that adding the opposite sign 
dileptonic channel only slightly changes the statistical significance because the contribution 
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from ti to opposite sign leptons is much larger. 

In comparison, our final result is S/B ^ 49/24, which is ~ 20% better in S/y/B,^ and 
~ 200% more signal events are retained. This result is obtained by scanning the cuts 
to maximize the significance. Although the signal-background ratio is smaller than the 
dileptonic channel, if needed, one may choose the cuts to increase S/B without sacrificing 
significantly the significance. As mentioned in the previous subsection, by using different 
cuts, we obtain S/B ^ 20/4.3 for 100 fb~^. This S/B is similar to the dileptonic channel, 
and S/yB is only slightly smaller than the optimal result. 

Other works on semileptonic WW scattering include Ref. [TU], where the jet substructure 
method was proposed. A direct comparison to Ref. [10] is impossible because they focused 
on models with resonances which can greatly enhance the signal and consequently they 
simplified the signal definition by including all WW — )■ WW events instead of using the 
subtraction scheme we employed. Nevertheless, we have incorporated the filtering/mass 
drop method [TT] as an essential step in our W jet tagging method, which is an improved 
version of the ?/-splitter method in Ref. [TU]. For comparison, we did a check by turning 
off the MVA W jet tagging method and using filtering alone to identify the W jets. Then 
we repeated the optimizing procedure by scanning over the essential cuts, and obtained the 
best significance ~ 6.7 per 100 fb~^ {S = 40, B = 25) which is ~ 16% lower than the 
outcome using MVA W jet tagging. A more recent parton-level analysis in the semileptonic 
channel is given in Ref. [13]. However, the jet substructure method was not used in that 
analysis, consequently a much smaller S/B ratio was obtained which led to pessimism for 
this channel. 

4. CONCLUSION 

In this article, we have demonstrated that the semileptonic channel can play a major role 
in extracting new physics from high energy WW scattering at the LHC In particular, we 
have considered the strongly interacting light Higgs scenario as a new physics example that 
induces an excess in WlWl — )■ WlWl scattering, compared to the prediction of the SM 



^ When the number of events is small such as at 100 fb ^, the actual improvement in significance is even 
larger due to the deviation from Gaussian statistics. 
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FIG. 4: Reconstructed Mww for the signal and the backgrounds. The sohd curve indicates the 
Higgsless signal. The dotted (dashed) curve denotes the SILH signal with (c//^)^ = 0.5 ((c/f^)^ = 
0.2). 



with a 125 GeV Higgs. In practice the signal of this model can be approximated based on 
that of the Higgsless model by a simple rescaling. 

Compared to the traditionally "gold-plated" dileptonic channel, the semileptonic channel 
is blessed with a larger branching ratio and the possibility to reconstruct important kine- 
matics, although it bears larger backgrounds. Useful schemes such as imposing tagging jets 
cuts and central jet veto have been found effective for reducing QCD backgrounds, yet still 
not enough to make the semileptonic channel competitive to the dileptonic channel. We 
have shown that, assisted by the recently developed jet substructure techniques, in particu- 
lar the multivariate W jet tagging method [6], we can further greatly reduce the persisting 
backgrounds from I^+jets and tt+jets, while keeping a significant amount of signals. The 
best statistical significance {S/yB) we have achieved is about 20% better than the best ex- 
isting result for the dileptonic channel. Meanwhile, more than ~ 200% of the signal events 
are retained, which can benefit further investigations on properties of the new physics. Al- 
though the signal-background ratio is moderately smaller than the dileptonic channel, a 
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similar ratio can be obtained by slightly sacrificing the significance (yet still larger than that 
of the dileptonic channel). These results make the semileptonic channel as good as, or even 
more promising than the dileptonic channel. Applying the method to the SILH models, we 
found that the signal can be discovered at the 5o" level if its strength is 100% (10%) of a 
pure Higgsless model, using about 40 (3000) fb^^ data at the 14 TeV LHC. In terms of the 
sensitivity to the anomalous Higgs-W^ boson coupling, we can reach ~8% at the la level. 
WW scattering is therefore a good complement to the precision measurements for probing 
hWW coupling, while having less sensitivity to uncertainties such as from invisible/exotic 
Higgs decays. In addition, our method is also expected to boost the discovery opportunity 
for heavy resonances responsible for unitarizing WW scattering in the SILH models. 

Meanwhile we would like to comment that the analysis in this article is not comprehensive 
and can be further elaborated by including subleading effects and uncertainties, such as: 
other channels, e.g. WZ and ZZ scattering; the cross sections at next-to-leading order; the 
effect of pileup; experimental uncertainties beyond simple geometrical acceptance cuts and 
hadronic calorimeter resolution, etc. Nevertheless, we do not expect these to change our 
conclusion, i.e., with the jet substructure techniques, the semileptonic channel is among the 
best for WW scattering studies and more efforts, both theoretical and experimental, should 
be devoted to this channel. We hope our work could shed new light on finding new physics 
from WW scattering: an important yet conventionally challenging process. 
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Appendix A: W jet tagging 

As mentioned in the introduction, two major differences make a W jet distinguishable 
from a QCD jet. Firstly, a W jet contains two sub-regions where the jet energy is concen- 
trated, originating from the two quarks from the W decay. These sub-regions are usually 



18 



called subjets. Subjets can be conveniently identified by using a recombination jet clustering 
algorithm: one first finds the constituents of the original fat jet which is obtained with a 
jet radius -Rfat, then uses a smaller R < _Rfat to recluster the constituents, yielding more 
than one subjets. Usually, the two leading subjets in a VT jet correspond to the two partons 
from the W decay, and have a balanced momentum configuration. On the other hand, QCD 
splitting tends to produce subjets with hierarchical momenta. Therefore, a QCD jet usually 
contains one and only one hard subjet, together with a set of soft subjets. In jet grooming 
algorithms [HI [121 HZ], one takes a step further by discarding the soft subjets and keeping 
only a few hard subjets in the "groomed jet" . After grooming, the mass of the W jet does 
not change significantly, which is still around the W mass, while that of a QCD jet is often 
shifted to a small value. One can then use a W mass window cut to eliminate most of the 
QCD jets. 

Due to its effectiveness, the first step in our W tagging method described in Ref. [6] is 
to use the filtering/mass drop algorithm [TT] to identify W jet candidates. In particular, we 
start from fat jets clustered with R = 1.2 using the Cambridge/Aachen jet algorithm. We 
then follow the mass drop algorithm in Ref. [llj to find a smaller subjet radius, -Rfiit, and use 
it to recluster the fat jet into subjets. We keep the leading three subjets as the new, filtered 
jet. The jet mass after filtering is required to be within the mass window of (60, 100) GeV. 
The filtering/mass drop parameters are optimized to maximize the significance, S/y/B, 
where S is the number of W jets and B QCD jets. We did this for different jet p-r' bins (in 
50 GeV steps) separately from 200 GeV to 1 TeV and found a factor of ~ 2 improvement 
in S/yB for all pr's considered. 

Despite the success of the jet grooming algorithms, a good amount of QCD background 
jets still remain. This is because, although rare, hard splitting does happen in QCD jets 
which can mimic a W jet more closely. In order to further eliminate QCD backgrounds, we 
make use of the difference in radiation patterns between W jets and QCD jets, as a result of 
their different color structures. VT is a color singlet particle, therefore, when highly boosted, 
(almost) all radiation from its decay is contained in a small cone around the W momentum. 
This is different from a QCD jet, which is initiated from a colored particle with radiation 
more diffusely distributed. This difference is visible in jet shape variables such as planar flow 
[27[ and N-subjettiness [28j. In Ref. [H], we proposed a set of simple but powerful variables 
which we dubbed mass and px R — cores. These mass (px) R — cores are defined as the 
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ratios between the original jet mass (pr) and the leading subjet mass (pr) reclustered with 
a set of smaller i?'s. The R — cores measure how the masses or p^'s grow with an increasing 
R, which are sensitive to how the radiation is distributed. 

None of the above variables alone can account for all differences between W jets and 
QCD jets, therefore, the most powerful way to use the variables is to combine them in a 
multivariate tagging algorithm. In Ref. [6J, we selected 25 most useful variables and com- 
bined them using the Boosted Decision Trees method. These variables include the masses 
and pt's after jet grooming, planar flows, px R — cores, etc.. The multivariate method was 
applied to jet samples that have passed the filtered mass window cut and another factor of 
~ 2 improvement is achieved in S/vB on top of the jet grooming algorithms. One may 
also simplify the method by including fewer variables and achieve improvement nearly as 
good. For example, we selected a set of 7 variables which give a result about 25% worse than 
using the full set. One may also include other variables such as N-subjettiness [28], charged 
particle multiplicity [23] and volatility [32]. However, adding more variables are not likely 
to significantly improve the W tagging performance because they contain redundant infor- 
mation (e.g. we have checked by including N-subjettiness and charged particle multiplicity). 
In this article, we use the method in Ref. [6] with the full set of 25 variables. 
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